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universality of (p*2? Tag systems 
and a h. symbol 7 state Univers al Turing Machine 

M, L. Minaky 

§0. Introduction. 

This report describes (1) an improvement and great simplifica- 
tion of the proof that the "Tag" systems of Post can represent any 
computable process, and (2) a Universal Turing machine with Just 
four symbols and seven states— the smallest yet reported. 

part §1 of the report gives a very simple proof of the baalo 
theorem II of ref* [1], that any Turing machine has a representa- 
tion as a Tag system. The result is that we obtain systems with 
deletion number p * 2 (see U below), an improvement over the re- 
suit p - 6 in ref. [1] . The simplification over the complicated 
methods of [1] la due In part to a better representation of the 
Turing machine, suggested by Dana Scott, and in part to a system 
of Post productions which yields the p * 2 result directly . The 
result of §1 Is due Jointly to John Cooke and the writer., 

The simpler structure of the production rules enabled us to 
simplify both the structure of, and the encoding for, the 6-state, 
6-symbol Universal Turing machine reported in [2] . This is des- 
cribed in part §2 of the report. 

Part §3 Is Just a note simplifying (?) the basis for recursive 

functions developed in [1]. 
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H. Tuvln g Machlnea and p » 2 Tag ay a tenia. 

An instantaneous description of a Turing machine computation 
includes the contents of the Tape, the location of the maohine on 
the tape, and the machine's internal state. The situation can be 
represented as 



a l 



*i 



ZZZ3 



Following a suggestion of Dana Scott, we can regard the des- 
cription as composed of four numbers; a, q^, m * £ a^2 , and 
n « T b,2 l , We assume that the Maohine is binary, so that the 
a,** and b,«s are all ones and zeros. All but a finite number of 
the a.* a rind b,*s are zero, so the summations are defined. 

A Turing machine T is a set of quintuples <l 1 »»j s < kj» 8 ij» d lj ° 
Suppose that d,. happens to mean "move right". Then this quintuple 
has the following effect on {a, q i ,m,n} 

1. change m to 2m + s^. 

2. if n •«■ 2p + b Q , change n to p and remember 
is the next symbol to be read. P is [j] . 

3. control is transferred to the quintuple starting with 

That IS 



b Q which 




(q 18 s^,m,n} -► {«ij»n-2|§j ,2m + s^, jjjj 



where [xj is the largest integer in x . If d^ is "move left" 
then the roles of m and n are interchanged. 
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The Turing Machine is thus equivalent to a programmed computer 
which executes Instructions I, . which perform the above operation 
on m and n • Each instruction is followed by transfer of control 
conditional on the value (0 or 1) of the symbol read in phase 2 
above m This computer has 2 (Infinite) data registers, for m and 
n * and an instruction location register to remember the pair q***. 

It is quite easy to represent this computer in a "hi-deletion" 
(p » 2) Post Tag system* See [1] . The system operates on a string 
of symbols which contains unary representations of the numbers m 
and n The instruction location Is represented by the choice of 
letters composing that string* For each instruction we will employ 
a different alphabet, distinguished by subscripts « 

At the start of the k-th instruction I k corresponding to 
{q^Sj} the working string will have the form 

where the J*s represent symbols whloh can never affect the process » 
An operation of phase (1) is represented by Tag productions of the 
form (P ■ 2) 

A k Xfc ^ y k 

I a i 4 4r 

so that 



A k I(^I) m BjRyJp n - A^D^-iJ B^)* . 

Note that we have already begun the division of n by 2o This is 
the beginning of phase (l)o 
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The outoome of phase (2) has to depend on whether n was even 
or odd, I.e., on Whether the new symbol b Q is aero or one. Suppose 
that for n even we are to tranafer to the k -th Instruction and 
for n odd to the kj-th instruction. Then 
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Is a set of productions whioh does what we went. To oheok this we 
traoe through two oases; for simplicity let the Sjj assoelated 
with this quintuple be sero. If n is odd then n » 2p + 1: 

A*** Bl(yi)^* 1 

Since the string is composed of even-length blocks, the double- 
primed letters oan never affeot the proceaa and the string is equi« 



and control is transferred to the productions governing the 
lu-th alphabet o 

For even n 9 we have 

and the correct new values of m and n are delivered to the pro- 
ductions governing the k Q -th al P nabet » 

The representation of a Turing machine as a 2-register computer 
is a variant of that in Dana Scott's sketch^ 'of an elegant proof 
of Theorem I of [1]. In that paper we used a kind of second-order 
ari thine tizatlon so that the Turing machine condition could be repre- 
sented in the prime factorization of a single number, John Cooke 
suggested that one ought to be able to manipulate the two quantities 
of theorem la of [1] directly B and by using the "phase-shifting" de- 
vice buried in the lower left of the present production diagram he 
and I were able to reduce the deletion number to 2, as shown above a 
Putting these ideas together eliminated the two major complications 
of [1], namely the factorization in Sec, 1,2 and the division algorithm 
in Sec, 2,1 of [1], The demonstration in [1], that there are univer- 
sal two-tape non-writing machines, can be reconstructed easily from 
the above. 



(1) Observe that eaoh (qjS.) •+ 12 productions, and by sane merging, 
each <l* ~* 13 prods, J 

(2) Personal communication. 
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- ^ir* o universal machine If 
rf- follows from »1 that •• oan ■*• • TO1T 

aimule*. th. application of an arbitrary Tag ays .am with 
we can simulate too »ff differ- 

h™. 2 to an arbitrary sequence of symbols . The differ 
deletion number 2 to an ar^ •> 

* a »* different blocks made up of the 
^ _wi. will be represented by dirreren^ 
ent symbols will be r p ^ ^ 

four symbols available to our machine. The tape 
will be divided into three regions: 



| rules region 



erased region 



— v . 

sequence region ^ 



, ^ i t and Bo will represent the 
we wll l u.a four symbo . 0. ^ »-_ ^ ^ ^^ ^ 

blank ., u .r.s o -;- -J^.. ^ and the wording 

zzzz:l~~ - - — - - - *- in 

the "sequence region" . 

tot *. letter, of the alphabet of th. T. g ay-am of 
A and lot th. promotion rula. nava th. form 
X "" " . , (From th. oonatmotlon In ll -a Know that 
tAj-*jl*J2— A 3r(J)'- rep ra.ant 

,-.(«<<' but we do not use this fsot.) 

1 £ r(J) <* follows, 

these in the "rules, region" as strings of 

Define j„l j-1 

3 i>i i- 1 

How for each letter Aj we -i ^_ 

•I 



where O n means n zeros* We concatenate these In descending order 
to form the representation of the rules: 

R m R n R n-l° #oR l ° 

When a symbol Is encountered In the sequence region, It will be ne« 
cessary for the Turing machine to locate, in the rules region, the 
corresponding rule* The use of the representation XT for a symbol 
makes this location easy, for the value of 77 gives (almost) the 
number of l f s that the machine must traverse to reach the appropri- 
ate rule (moving from the right) The double 1 1 at the left marks 
the end of the rule*, The counting process used in locating the 
rule does not count the left one of this pair so that It sees just 
r(i) + 1 ones In the counting * The symbol has one extra because 
the copying process which places the new word at the end of the se- 
quence region skips the first of each symbol copied 

Finally we must represent the working string in the sequence 
regiono This Is marked as a string of X f s and B»s* If the sequence 

is At,. A v oooA^ then it Is represented in the sequence region* 
K l *2 *s 

from left to right now, as ^^^ mmmmmmm 

1 2 s 

S ^ X BX Boo oBX © 

The symbol fa^ will have ^ « 1 and ?\J - 1 will be Op 
This symbol must not be used In regular operation (because it will 
cause 2 consecutive B f s In region s) but, as seen below, can be 
used as a halting symbol with the production an-* ^hfiiSi « 

Finally the "erased region 1 *, where initial parts of the working 
sequence have been deleted (we do not shift the sequence to the left 
between applications of the rules), is Just an Interval of f s (blanks) 
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So the entire tap* has the form 



• • • 



tet 



• 9 



4y° 



and the maohlne* a operation begins with the reeding head centered 
on the first mark (an X) at the left of the sequenoe (S). 
Example of encoding t 

Suppose the productions are 

•1^*1 «1 *1 
a 2 - a 2 a 2 

Then r(l) - 3 » r{2) « 2 , r(3) - % , 
•J. •' X • 



» 2 "5 » «3 • 8 , 



and 



R x - 11010101 



Bg * 110*10 5 1 

R 3 - 110 8 10 5 10 8 10 8 1 . 

If the starting sequence is a 2 a 2 a 2 a 5 9 the machine *s initial tape 
will be 



3. 000 lllO^O^lO^O 6 !! 110^10^1 
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Operation of the Universal Turing Machine. 

When started, the machine moves back end forth oounting X»s to 

the right and l«s to the left. When It enoounters the first B to 

the right It will have erased the first symbol A k of S, and It 

1 
will have loc ated to the left the rule R,_ associated with that 
"l 

symbol o It then goes Into another back-and- forth mode which copies 
the rule word at the extreme right of S, It works from the Inside 
out, which Is why the sequenoes are written left to right In S but 
right to left In R. Each In an R yields an X In S, and each 1 In 
R yields a B in S, (When the latter occurs, an extra Is erased 
In R, accounting for the difference of one zero In the representa- 
tions on the two sides.) 

When the »11» Is encountered in R, copying stops and the machine 
restores the tape to Its initial configuration, removing the place 
markers set up in the locate and copy phases. This restitution pro- 
cess, almost by magic, erases the next word in the sequence region S, 
thus executing the process with deletion number 2. In the operation 
of the machine, one will note some manipulations exchanging l's and 
B»8; these are tricks by which we mark looations without using new 
symbols. 

Below is the state transition table for the machine. The tri- 
ples associated with each symbol-state pair are: f symbol wrltten c 
direction of motion , and new state (if different)]. The machine 
starts in state 1 on the first X in region (S). 
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The following sequence demonstrates most features of the 
machine* a operation* 



START 



® 



B 



I 4 i 

q ? X B XX B 



- S ■♦ 



This sequence contains one genuine rule string - 110010001 - which 
when copied produces the sequence BXXBX • The two l»s in the 
center of the example are skipped over in reading, and are there 
to illustrate how the rule is looated— precisely by skipping over 
two l»s. When the string is copied, the original is erased, so 
this example keeps cycling and never halts. 
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The Halting Condition 

The machine will halt If It is In a tat* q^ and encounters 
a . this will occur If there are two oonseoutlve B's In S 
encountered In the locating phase. We can define a letter Ag 
In the alphabet of the formal system to be a Halting symbol which 
atops the generation of aequenoes (e.g., by providing no rule for 
operating on It), In the Turing machine representation this is 
easily represented by aa signing to the symbol Ag the R-sequenoc 

110 10 1 

which will oauae two B'a to be written oonaeoutlvely at the end 
of S . When the machine enoounters this it will stop. It is a 
little hard to inaure that this will be hit in the correct state, 
so that it may be simpler to guarantee this by writing 3 B»a, 
• •g»t by using for Aj, the sequence 1101010 1. For this 
we should take r(H) ■ 3 • It la convenient to let A H be aj^ . 



For a general diaoussion of amall Universal Machines, see [2] • 
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$3o The Reduction of Machine Computations to Programs Involving 
Two Arithmetic Instructions Operators on Two Registers o 

Theorem I of (1) states that 

we can represent any partial recursive function T(n) by 
a program which operates on two integer variables S^ and 
Spl this program is composed entirely of instructions of 
the four types 

(Al) Add 1 to S 1 c Go to Ij# 
(A2) Add 1 to S 2 o Go to I*. 
(Bl) If S, > 0, subtract 1 from S-. and 

go to 1^5 otherwise go to I^* 
(B2) If S~ > 0, subtract 1 from Sg and 
go to I^j otherwise go to I -go 

By a partial recursive function we mean, of course, a mapping 
from some integers to other integers defined by a Turing machine 
together with an Interpretation of some of its tape configurations, 
as representations of Integers* If T(n) is such a partial function 
the cited theorem states that there is such a program which , if 
started at an initial Instruction I* with S 1 set at 2 n and 
S 2 set at , then (If T(n) is defined) the program will halt 
at a terminal Instruction I with S^ * 2 * n * and S 2 • • 
(•Hie question Is open, and presumably negative, as to whether one 
can do better with programs of such instructions so as to eliminate 
the exponent in the representation o) 

The theorem cited above concerns programs made of four instruction 
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types* We can reduce this to three » in effect by replacing 
types A2 and B2 by an exchange operation which switches S^ 
and S 2 * We oan reduce the basis to two by merging the exchange 
operation with Al and Bl as follows 

We define new add and conditional subtract instructions: 

(A) Add 1 to S , exchange S^ and Sg* and go to. I «• 

(B) If S, > 0, subtract 1 from S*, exchange S^ and 
Spt and go to I^j otherwise exchange S, and Sp 



and go to I 



J2° 



Now consider the effeet of a linear sequence (subroutine) of the 

fOOT 

A-»A-»A-»B3lj« 

This will add 1 to S,, then add 1 to S~, then add another 1 to S^ 
and then subtract 1 from Sp« The latter subtraction must occur 
since the second add assures us that S 2 will not be zero at the 
time of the fourth operation • Thus the net result amounts to an 
operation of the form of the original Al, except that we have 
added 2 to S^ Instead of just 1* 

Similarly, it can be seen that the subroutine AABA 

A-*A-*B"*A-*I, 

has the effect of A2 9 again with an increment of 2 instead of lo 
Now consider the sequence 



- B 5 B * bC j1 



X J2 
^ This leaves S* invariant In any case We make both branches from 

the second instruction go to the third* Then If Sp Is zero the 



program exits from the second branch without changing anything. 
If S 2 > 2 then S 2 i» decremented by 2 and the prgram leaves by the 
first branch. So unless Sg happens to equal 1 , this routine is 
like B2 except for decrementing by 2. Similarly 

B 3 A ■♦ B J 
"* *J2 

is like Bl. 

If we note that the numbers are changed by two in every case, 

we see that if we restrict 3, and Sg to he even numbers then the 

system (A A A B 9 A A B A, A B B B, B A B B) is isomorphic to the 

old system (Al, A2, Bl, B2) . 



I wish to acknowledge the value of discussions with A. R« Trltter 
in connection with the work reported here* 



IY~— A 2-symbol 25~state Universal machine. 



The diagram below gives the structure of a 2x25 machine obtained from 
the 4x7 machine by encoding the symbols of the latter in two-digit blocks: 
00 ; 1 01 ; X 10 ; B 11. 

The translation from the 4x7 machine is straightforward, with the same 
tape encoding. Each of the seven states is replaced by a three-state decoding 
tree. In some of these trees it was possible to encode the full quintuples, 
but in others additional states were needed. There is no reason to suppose 
this is the bfi£traaslatioaJLnto 2 symbols of 
foiir\ >fb i Ifcrp 
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